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An increasing amount of interest is being shown in the study of the 
communication process in complex organizations. To understand any system 
fully it is necessary to know its structure— especially the way in which the 
various components are linked together. However, there has been relatively 
little scientific effort devoted to devising formal methods for the study of 
communication networks in large complex organizations. 

Several methods of analysis have been tried since the earliest publica- 
tions of Festinger, Jacobson, Seashore, and Weiss, all in the early fifties. 
Some of these methods work wall for small organizations; some work well for 
simple organizations. None work well for large complex organizations. 

This limitation has been a major barrier to the analysis of communication 
network structures. 

In the following sections of this paper, some of these early methods 
will be examined briefly and their shortcomings will be discussed. Then 
a new method, one that is promising for very large systems will be presented. 
Finally, a section will be devoted to a discussion of computerized applica- 
tions of this new method. A sample analysis will be worked out using the 
algorithms as they are presented. 

CRITIQUE OF PRESENTLY USED METHODS 

Before discussing the various methods of analysis, it is necessary to 
discuss the goals of network analysis, as applied to communication networks. 

An organization can be thought of as a system. As such, it will have 
components and connections between them. It is convenient to think of 
these components as small groups or cliques , composed of people who talk 
mainly to others in the same group. The connections, then, are those 
people who talk to people in more than one group. These connectors have 
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been called liaison agents if they do not belong mainly to the groups which 
they connect. If they do belong to a group and still talk to people in other 
groups , they are bridge agents . Thus , liaison agents and bridge agents differ 
in that the former do not belong to primary groups, whereas the latter do. 
Similarly, a person who talks to no one is an isolate * Most of the defini- 
tions presented above were drawn from Jacobson, and Seashore, 1951. A 
graphical representation of these terms is shown in Figure 1. 

The preliminary goal of network analysis of complex organizations is to 
identify groups, liaison agents, bridge agents, and isolates. Only after 
this is done can dynamic processes relating these components be effectively 
studied. That is, to study such things as the factors influencing the for- 
mation of cliques, the movement of messages, the effectiveness of liaison 
agents , and so on ; it is necessary first to be able to identify the members 
of cliques and the liaison agents; second, to isolate the relevant variables; 
and finally to study the effects of these variables on the formation and 
growth of communication networks. 

The simplest method for examining the communication structure of an 
organization is by constructing sociograms. For every person in the system, 
a point is put on a sheet of paper. For every instance in which any one 
person reports that he talks to any other, an arrow is drawn, connecting the 
points corresponding to this pair of people, and pointing toward the second 
person’s point. After all arrows have been drawn, it is a simple matter to 
see the structure , if the organization is small and simple and the points 
happen to fall in the right places. For more than fifty people, the task is 
difficult. For more than a few hundred, it is practically impossible. (The 
diagram in Figure 1 is an example of a sociogram.) 
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A method that seems more promising, if only because of its rigid logical 
elegance, involves graph theory. Graph theory uses formal concepts, such as 
points, lines, articulation points, trees, fundamental cut sets, and so on. 

The fact that organizations of people are not structured by formal rules 
seems not to bother protagonistis of graph theory. Since the algebra of 
this theory is not yet sufficiently developed applications of this method 
become laborious when large numbers of people are involved. 

A third method, although not as esthetically pleasing as graph theory, 
has enjoyed much more attention. This is the matrix algebra method (Festmger, 
1949). This method deals with an NxN matrix (binary in its simplest form, 
but sometimes probabilistic) which is defined by the rule: 



The entries in the matrix are often restricted only to those which cor- 
respond to reciprocated contacts. (Figure 2 is the binary matrix for the 
same organization diagrammed in Figure 1.) To determine the structure of the 
network, the matrix is raised to successive powers. People who have non- 
zero entries on the main diagonal are considered to be clique members. This 

method works well, if there is only one clique * 

If there are several cliques, a method for determining how many there 
are and to which one each person belongs must be devised. If any individual 
belongs to mure than one clique, a complicated and laborious process must 
be initiated, in hopes of deciding what should be done with this person. 

This method, also, bogs down with large complex organizations. However, 
most of the processes used are simple and explicitly defined, and thus 
amendable to computerization. This is useless, though, because very large 
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amounts of computer memory are needed to raise the initial matrix to the 
powers demanded and to store the auxiliary matrices needed. Consequently, 
this method is impractical for large complex organizations, without incurring 
unrealistic computer time costs. 

Weiss (1956) discusses a hybrid method, utilizing parts of both graph 
theory and matrix algebra theory. When combined with brute physical manipu- 
lation, his method works for large numbers of people, if enough time is 
available. For this process, the binary matrix described earlier is used 
as a starting point. Rows and columns of this matrix are painstakingly 
manipulated until the non-zero entries appear to cluster around the major 
diagonal (see Figure 3). This is done with the aid of the organization 
chart for the system whose structure is being studied. 

After this re-arrangement, the matrix is often squared, so that liaison 
agents can be identified and isolated. Although -his method usually works, 
the process is extremely time-consuming; and, since it requires many subjec- 
tive decisions to be made, it is not suited to be programmed for a computer. 
(This writer is aware of only three instances in which this method was used. 
They arej Weiss, 1956; Schwartz, 1968; and MacDonald, 1970). 

Other methods have been tried, and have been found wanting for various 
reasons. One of these involved the computerized application of factor 
analysis routines to the original matrix (Wackman, 1967). These routines 
are extremely slow for matrices numbering up to one hundred people; for larger 
groups they are not readily available. Also, the resultant factors and load- 
ings emerging from different rotations are of doubtful correspondence to any 
of the desired components of network structure. 
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It is the author’s belief that the use of any of the methods described 
failed primarily because they were not conceptually suited for the analysis 
of large or complex organizations. The fact that some of them apparently 
work for small simple organizations is unfortunate; for this has encouraged 
further attempts to apply them to larger systems, where they inevitably 
fail, in the next section a theoretical basis for direct, conceptually 

simple process will be presented. 

CONCEPTUAL BASIS FOR AN ALTERNATIVE METHOD 

The theory for the method to be presented in this paper is based on 
the definitions of groups , liaison agents , bridge agents, isolates, and 
membership . Therefore it would be appropriate to review these definitions 

here (Weiss, 1956 ). 

A group or clique is a set of people who talk more with each other than 
with people outside their group. 

A person is a member of a group if over half of his communications are 
with people in that same group. 

A person is a liaison agent if most of his communications are not with 
members of any one group, and if he has at least three contacts. 

A person is a bridge agent if he belongs to a group but talks to at 

least one person in another group. 

A person is an isolate if he talks to no one. 

The goals of network analysis, as discussed in this paper, are to 

identify these components in an organization. 

The data used as a starting point for this analysis are in the form of 
sociometric communication reports. People are asked to list the people they 
talk to. These data are usually modified by the elimination of non-recipro- 
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cated contacts. After collecting the data, there are three major steps 
in the method of analysis to be presented: 1) ordering the communication 
matrix, so that people who talk to each other are near each other in the 
matrix; 2) identifying the liaison individuals; and 3) separating the small 
groups within the organization. (These steps are very similar to those used 
by Weiss in his method. The means of performing them, however, are quite 
different . ) 

ORDERING THE MATRIX 

From the definitions, it follows that the distinguishing behavior of a 
person in specific group is that while he talks mainly to people in that 
group, non members don’t. Similarly, the behavior which makes a group 
member similar to other members of thi- same group is that both talk mainly 
to the same people (see Figure 4). If a formal rule for expressing this 
’’differentness” or ’’sameness” in mathematical terms is used, the structural 
sorting process can be specified exactly enough to be done with only 
objective decisions. 

Another statement that can be made about the relationship between a 
group member and another member of that group is that, since they both be- 
long to the same group, they probably talk to the same people. That is, 
there will be many shared contacts, or two-step links jx .ween them. This 
will usually not be the case for two people not in the same group . 

To put this concept into mathematical terms is relatively simple: 

Each person is assigned a unique subject number. Then for each person in 
the sample, there will be a list or distribution of contacted subject 
numbers (Figure 5a). To see whether or not two people have the same con- 
tactees, the distributions of each person's contactee's subject numbers are 
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compared. If they are similar, the people are probably in the same group. 

A simple way to compare two distributions is to compare the means_ of those 
distributions. Thus, if two people have similar means, they probably have 
similar distributions and therefore they probably have pretty much the same 
contacts, and are thus in the same group (see Figure 5b). 

This procedure works well only when all the people in any one group talk 
only to other people also in that group. However, this is often not the 
case. In many instances, individuals may have contacts with people outside 
their group. This can throw a person’s mean off enough to make it different 
from the means of all the other members of his group. To solve this problem 



and to make the sorting process more efficient and accurate, a weighting 
process is \ised that gives more emphasis to those contacts that are with 
people inside the group than with those outside the group. 

Previously, it was pointed out that there would be more two-step links, 
between people in the same group than with people in different groups. Thus 
the number of two-step links (shared contacts) is suggested as possibly the 



ideal weighting factor, since this number is both a valid discriminator and 
easily calculated (Figure 6a). The mean then would be replaced by a "weighted 
mean" formed by taking the Siam of the products of a person’s contactees* 



subject numbers and the weighting factors, and dividing this sum by the sum 
of the weighting factors, i.e.. 



MEAN-j- = 



N 

I + Z WF 
J=1 



• » 



CN- 



N 



, where WF . . is 



1 + E WFij 

the weighting factor fo’r the I-J contact; CNj is the subject number of 



person J on I’s list of contactees; and N is the number of contactees 
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person I has. (see Figure 6b). If the weighted mean for each person in the 
sample is computed, and this list of means is rank ordered, people with 
similar distributions will be placed near each other. 

Two problems are inherent with this method. One is that it is possible 
for the means of two quite dissimilar distributions to be the same, causing 
people who are not at all like each other to be put near each other after 
rank -ordering the means. 

The other problem is that, for large organizations and random assign- 
ments of subject numbers, there will be high variances for the distributions 
of most people, even with the weighted means. For example: For an organiza- 

tion of 300 people, a person could have contactee numbers ranging anywhere 
from 001 to 300. This high variance implies that the means will not be 
uniquely characteristic of the distributions. In spite of these facts, 
there will be a better placement of people following the rank-ordering than 
before it (see Figure 7). 

If this ordering process is repeated several times, each time replacing 
the old subject number of each person with his rank in the ordered list of 
means, there will be a convergence to a solution for the system. This solu- 
tion will be an ordering of people in the sample so that those who talk to 
each other ire next to each other (Figure 7b). This ordering will be such 
that all the people in any one group will be very close to each other, and 
the variances of the distributions will be minimized. If the data are put 
in the form of a binary matrix, as described earlier, the non -zero entries 
will be clustered about the major diagonal; that is, the distance from the 
diagonal to the non-zero entries will be minimized (see Figure 8). 
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The ordering process is thus composed of these steps: a weighted mean 

is calculated for each person; these means are rank -ordered, resulting in the 
assignment of a rank number to each person; old contactee numbers are re- 
placed by the new rank numbers; a test for convergence is made; and the process 

is either terminated or repeated (see Figure 9). 

The convergence test could be based on any of several criteria. Two 
possible limits are: if the variance for each person's updated distribu- 

tion of contactee numbers (CN's) is calculated, and then all the variances 
are summed, a measure of the total variance is obtained. If this total 
variance no longer decreases from one iteration to the next , the solution 
has converged. A simpler test would be to see if the rank numbers (RN's) 
change from one iteration to the next. When they no longer do, convergence 

is complete. 

It may not be necessary to reach absolute convergence before proceeding 
on to the next step in the analysis. For this reason a convergence criteria 
might be "when less than 5% (10%, 2%) of the ranks change from one iteration 

to the next, stop." 

Identification of Liaison Individuals 

Liaison agents must then be identified. Since, by definition, liaison 
agents talk to people in different groups , their distributions of contaotees 
will be widely dispersed in the rank -ordered, lists. A very characteristic 
aspect of their distributions will be relatively high degrees of variance. 

An important consideration should be given to the use of variance as a 
measure of "liaison-agent-ness”. A person with many contacts may have a 
relatively high variance, even if all of his contacts are with people in one 
group. It would be advantageous, then, to use a measure of variance relative 




19 






* 



pimwe: q 



PLOtM CHl»«T N5* the- process 




o 

ERIC 



MO 

Kofis 

sftiperj 



20 



I 



10 

to the lowest possible variance for this number of contacts. In other words, 
the lowest possible variance for any given individual's distribution, based 
on the number of contacts he has, could be calculated and subtracted from 
his actual variance, giving what might be called a modified variance score 
(MVS). For people whose contacts are restricted to one group, the variance 
will be close to the lowest possible variance for their distributions. Thus, 
their MVSs will be very small. For liaison agents, however, the difference 
between the actu al variance and the smallest possible variance will be large , 
and this will be reflected in the high MVSs. This modification will make it 
much easier to identify potential liaison agents by providing a more sensitive 
measure of the spread of each person's distribution (see Figure 10a). 

Bridge agents will also have higher degrees of variance than other 
people. However, they have primary membership in specific groups, whereas 
liaison agents do not. A method for distinguishing between liaison agents 
and bridge agents, based on this difference is therefore suggested: A bridge 

agent belongs to a group. He talks to several members of his group. Most 
of his contactees talk to the same people he talks to. There are several 

two-step links between him and his contactees. 

A liaison agent, however, talks to people in several different groups, 
none of which he belongs to. There are few two-step links between him and 
his contactees, because they are all in different groups. Therefore, he is 
different from his contactees. 

The same numbers used earlier to weight entries in the means can be 
used here as an index of this "sameness" for a person and his contactees. 

It can be used to identify liaison agents and separate them from bridge 

agents ( see Figure 10b). 
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The following process is thus suggested for identifying liaison agents 
and bridge agents: 

Calculate the MVS for each person’s distribution of CNs. Scan the list 
of MVSs, searching for the largest. Examine this person’s weighting factors 
(WFs), to determine whether he is a liaison agent or a bridge agent. If he 
is a liaison agent, remove him from the system and continue searching for 
more liaison agents. 

After the removal of several liaison agents , the list of CNs may be 
considerably changed. For this reason the ordering process should be 
applied from time to time, say, every 5th (2nd, 10th) liaison agent. When 
no more liaison agents are left, the bridge agents are identified. To make 
the following steps easiest and more clear-cut, whenever a bridge agent is 
identified, a notification is made and the bridge contact is deleted from the 
appropriate lists. This will leave only those people who belong to groups; 
and the groups will effectively be isolated from one another (see Figure 11) . 
Separating the Small Groups 

The next step in the analysis is to separate the small groups from each 
other. If the liaison agents are removed from the lists, it becomes easier 
to separate the groups, because the list will contain only people belonging 
to these groups; and because those people sharing membership in the same 
groups will be placed next to each other in the list. Then, for all the 
people in any one group, the differences between their weighted means will 
be relatively small, compared to the differences between their means and 
those of people in other groups. All that must be done to separate the groups, 
then, is to scan the rank -ordered list, searching for pairs of adjacent people 
whose means are very different from each other. The boundaries are located 
at these points (see Figure 12). 
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This concludes the analysis of the communication structure. The follow- 
ing section is concerned with the reduction of computation time and data 
manipulation, so that the process might be completed more efficiently. 

COMPUTER RELEVANT CONSIDERATIONS 

For most efficient application of this method, the data should be 
structured, rather than in a matrix, which will typically have from fifty 
to ninety percent zeros , in the form of a series of lists . Each person will 
have a list of the subject numbers of his contactees . These lists will be 
referenced many times during the process and for this reason a list pro- 
cessing language might seem appropriate for computerization of the process. 
However, because of the tremendous number of calculations to be made, and 
the relative slowness of these functions in list processing languages, the 

ideal language seems to be one like FORTRAN-IV. 

To cut computation and data manipulation time, a few labor-saving 

devices are suggested: 

Rather than changing every CN in every list to a new RN every time a 
rank-ordering is completed, two parallel lists are set up. One contains the 
original CNs. The other contains the RNs. The latter list is updated after 
each rank-ordering, and is used to reference the lists of old CNs, much like 
an index is used to reference the chapters and pages in a book. 

Many long lists of numbers have to be rank-ordered in the course of the 
analysis. The usual method for rank-ordering a list of numbers is to scan 
the list, searching for the smallest number, then the second-smallest, and so 
on. For a list of n elements this requires up to n 2 /2 comparisons to be made 
The following alternative method could be used for long lists . Assume a list 
of up to 999 elements. The largest number has three digits. If the list is 
sorted into ten piles, by the digit in the one's column, and then again on 
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the ten's column, and finally on the hundred’s column, it will be rank-ordered 
Only 3n operations will have been performed. For a list of 900 elements, 

there is a saving of roughly 38,000 operations. 

The solution obtained on convergence might not be the best possible 
solution. This is because no proof of the unique best solution has to this 
date been formulated. Katz (1958) attempted to form a basis for the proof 
of this best solution, which he called the canonical form of the matrix 
(Forsyth and Katz, 1946). The present author feels that such a search, while 
not irrelevant, should not be considered essential to the creation of a 
workable method of analysis. In trial applications of the method it was 
found that it was not necessary to continue the iterative process of order- 
ing the matrix until absolute convergence was reached. This is because the 
separating methods were sensitive enough to be able to discriminate between 
liaison agents and others without absolute convergence. For this reason, 

an "almost absolute" criterion is suggested. 

There are several points in the a ’lysis which require specification of 

threshold values. Several trial runs are now in progress to determine the 
best values for these levels. It is conceivable that no "best" values will 
be found. This is due to the nature of the data. For example, the level 
of connectivity of the organization, measured by the number of non-zero 
entries in the binary matrix, relative to the highest possible number, would 
appear to have a great influence on the rapidity of convergence, as well as 
the relative numbers of liaison and bridge agents. These and several other 
related problem areas must be studied empirically before specifications can 





be made. 
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